Machine Learning for Information Extraction
نویسنده
چکیده
As an increasing amount of information becomes available in the form of electronic documents, the need to intelligently process such texts makes shallow text understanding methods such as Information Extraction (IE) particularly useful. IE has been restrictedly defined by DARPA's MUC program [MUC Proceedings] as the task of extracting specific, well-defined types of information from text in restricted domains and filling pre-defined template slots. MUC has inspired a huge amount of work in IE and has become the major reference in the field. A typical IE tasks is illustrated by the example in Figure 1 from the MUC-4 corpus that describes terrorist incidents. Even as such it is still a challenging task to build an efficient IE system with good recall (coverage) and precision (correctness) rates.
منابع مشابه
Machine learning based Visual Evoked Potential (VEP) Signals Recognition
Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...
متن کاملUsing Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media
Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...
متن کاملComparative Analysis of Machine Learning Algorithms with Optimization Purposes
The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches. Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data. In this paper, a methodology has been employed to opt...
متن کاملA Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection
Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...
متن کاملPascal Challenge The Evaluation of Machine Learning for Information Extraction
If the Semantic Web is to utilise the vast number of documents available on the WWW it requires an effective way to automatically annotate those documents, enabling the extraction of relevant information. The Pascal Challenge on the Evaluation of Machine Learning for Information Extraction provided a common basis on which it assess the relative performance of multifarious machine learning syste...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001